Hybrid Implementation and Performance Analysis for High Performance Computation Workload

نویسنده

  • Joseph Issa
چکیده

Given the need to achieve maximum performance possible, offloading intensive computation workload to GPU is a key to achieve this goal. Offloading most of the workload to GPU may not results in desired performance, so a middle approach is more suitable such as splitting the workload between the CPU and the GPU can be considered as an optimized approach. In this study, we used a popular high performance computation workload which can also be implemented using a hybrid approach in which part of the workload is offloaded to the CPU. We also present a performance estimation method which is verified to estimate performance with in 5% error margin.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallelization of the Treecode Algorithm for N-Body Simulation Using MPI, Hybrid, and GridRPC Programming Paradigms

This dissertation describes the parallelization of the treecode algorithm for N-Body problem and performance comparison among three different parallel programming paradigms, MPI, hybrid MPI-OpenMP, and GridRPC. In N-Body simulation, the specific routine for calculating the forces on the bodies which accounts for upwards of 90% of the cycles in typical computations is eminently suitable for obta...

متن کامل

Comparing performance of organization on implementation of customer relationship management systems using ANP and TOPSIS hybrid approach

As the customers are the main reason of the formation and survival of the organization, not only understanding their obvious needs, but also forecasting, determining and guiding their hidden needs, design and implementing plans of offering services for meeting these needs for attracting customers are among cornerstone of any activity in the organization. In this research, one compares the perfo...

متن کامل

A Novel Hybrid-Excited Modular Variable Reluctance Motor for Electric Vehicle Applications: Analysis, Comparison, and Implementation

A variable reluctance machine (VRM) has been proven to be an outstanding candidate for electric vehicle (EV) applications. This paper introduces a new double-stator, 12/14/12-pole three-phase hybrid-excited modular variable reluctance machine (MVRM) for EV applications. In order to demonstrate the superiorities of the proposed structure, the static torque characteristics and dynamic performance...

متن کامل

HPC Selection of Models of DNA Substitution for Multicore Clusters

This paper presents the High Performance Computing (HPC) support of jModelTest2, the most popular bioinformatic tool for the statistical selection of models of DNA substitution. As this task can demand vast computational resources, especially in terms of processing power, jModelTest2 implements three parallel algorithms for model selection: (1) a multithreaded implementation for shared memory a...

متن کامل

ارزیابی بارکاری ذهنی کنترلر های ترافیک هوایی بر اساس فاکتورهای باروظیفه در شبیه ساز کنترل ترافیک هوایی

Background and aim: Air traffic control has known as a complex cognitive task, which requires controller to focus on task for long time. Mental workload plays an important role in the performance of controllers. The aim of this study was to assess the workload of air traffic controller on the basis of task load factors. Methods: The present descriptive-analytical study was conducted among fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JCS

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2014